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1  Introduction 

In  multi-agent  domains,  agents  can  often  benefit  by  coordinating  their  actions  with  one 
another;  in  some  domains,  this  coordination  is  actually  required.  In  two-agent  encounters, 
the  situation  is  relatively  simple:  either  the  agents  reach  an  agreement  (i.e.,  coordinate  their 
actions),  or  they  do  not.  With  more  than  two  agents,  however,  the  situation  becomes  more 
complicated,  since  agreement  may  be  reached  by  sub-groups. 

The  process  of  agent  coordination,  and  of  reaching  agreement,  has  been  the  focus  of 
much  research  in  Distributed  Artificial  Intelligence  (DAI).  The  general  term  used  for  this 
process  is  "negotiation"  (usually  in  the  2-agent  case)  [1,  6,  8,  9,  17,  21,  11],  and  "reaching 
consensus"  (in  the  n-agent  case)  [2,  3].  Both  approaches,  though  dealing  with  different 
numbers  of  agents,  share  one  underlying  assumption:  the  agreement,  if  it  is  reached,  will 
include  all  relevant  members  of  the  encounter.  Thus,  even  in  the  n-agent  case  where  a  voting 
procedure  might  enable  consensus  to  be  reached,  the  entire  group  will  be  bound  by  the  group 
decision.  Sub-groups  cannot  make  sub-agreements  that  exclude  other  members  of  the  group. 
Interesting  variations  on  these  approaches,  which  nonetheless  remain  bilateral  in  essence, 
are  the  Contract  Net  [16],  which  allows  bilateral  agreement  in  n-agent  environments,  and 
bilateral  negotiation  among  two  sub-groups  discussed  in  [7]. 

In  some  domains,  however,  it  may  be  possible  for  beneficial  agreements  to  be  reached 
among  sub-groups  of  agents,  who  might  be  individually  motivated  to  work  together  to  the 
exclusion  of  others  outside  the  group.  Voting  procedures  are  not  applicable  here,  because 
the  full  coalition  may  not  be  able  to  satisfy  all  its  members,  who  are  free  to  create  more 
satisfying  sub-coalitions.  This  paper  considers  this  more  general  case  of  n-agent  coalition 
formation  (recent  pieces  of  work  on  similar  topics  are  [5,  15]).  Building  on  our  previous 
work  [21],  which  dealt  only  with  bilateral  negotiation  mechanisms,  we  here  analyze  the 
kinds  of  n-agent  coordination  mechanisms  that  can  be  used  in  specific  classes  of  domains. 

2  Coalitions 

2.1      An  Example — The  Tileworld 

Consider  the  following  simple  example  in  a  multi-agent  version  of  the  Tileworld  [10]  (see 
Figure  1).  A  single  hole  in  the  grid  is  represented  by  a  framed  letter  (such  as  [a]).  Each 
agent's  position  is  marked  by  its  name  (such  as  Ai).  Tiles  are  represented  by  black  squares 
(■)  inside  the  grid  squares. 

Agents  can  move  from  one  grid  square  to  another  horizontally  or  vertically  (unless  the 
square  is  occupied  by  a  hole— multiple  agents  can  be  in  the  same  grid  square  at  the  same 
time).  When  a  tile  is  pushed  into  any  grid  square  that  is  part  of  a  hole,  the  square  is  filled 
and  becomes  navigable  as  if  it  were  a  regular  grid  square.  The  domain  is  static  except  for 
changes  brought  about  by  the  agents.  I — , 

Agent  I's  goal  is  to  fill  hole  [^,  while  agents  2  and  3  need  to  fill  holes  [bj  and  [c] 
respectively.  To  fill  its  hole,  each  agent  needs  to  do  7  steps.     Agents  can  cooperate  and  help 
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Figure  1:  Three- Agent  Encounter  in  the  Tileworld 

each  other  to  reduce  the  cost  of  achieving  their  goals.  There  are  several  kinds  of  joint  plans 
that  the  agents  can  execute  that  will  reduce  the  cost  of  achieving  their  goals.  Some  of  those 
joint  plans  are  listed  in  the  table  on  the  right  side  of  Figure  1. 

The  coalition  structure  {1,3}, {2}  means  that  there  are  two  coalitions,  one  consisting  of 
the  agents  1  and  3,  and  the  other  consisting  only  of  agent  2.  When  two  agents  form  a 
coalition  it  means  that  they  are  coordinating  their  actions.  The  utility  of  an  agent  from  a 
joint  plan  that  achieves  his  goal  is  the  difference  between  the  cost  of  achieving  his  goal  alone 
and  the  cost  of  his  part  of  the  joint  plan  [20]. 

The  coalition  that  gives  the  maximal  total  utility  is  the  full  coalition  that  involves  all 
3  agents,  where  they  all  coordinate  their  actions  to  mutual  benefit  (total  utility  is  17).^ 
Although  this  full  coalition  is  globally  optimal,  Agent  I's  utility  is  only  4,  and  he  would 
prefer  to  reach  agreement  with  either  agent  2  or  agent  3  (with  utility  of  6),  but  not  with 
both. 

The  agents  in  the  above  scenario  are  able  to  transfer  utility  to  each  other,  but  in  a 
non-continuous  way.  Agent  1,  for  example,  can  "transfer"  to  agent  2  seven  points  of  utility 
by  achieving  his  goal.  He  cannot,  however,  transfer  an  arbitrary  amount.  Without  this 
arbitrary,  continuous  utility  transfer  capability,  agent  1  will  prefer  to  form  a  coalition  with 
either  one  of  the  other  two  agents,  rather  than  with  both. 


2.2      Coalition  Games 

The  definitions  below  are  standard  ones  from  coalition  theory  [4]. 

Definition  1  A  coalition  game  with  transferable  utility  in  normal  characteristic  form  is 
{N,v)  where:  N  =  {l,2,...,n}  set  of  agents,  and  v:2^  — »  IR.  For  each  coalition  which  is 
a  subset  of  agents  S  C  N,  v{S)  is  the  value  of  the  coalition  S,  which  is  the  total  utility  that 
the  members  of  S  can  achieve  by  coordinating  and  acting  together. 

The  joint  plan  where  agent  1  achieves  both  2  and  3's  goals  (with  cost  of  3),  while  either  agent  2  or  3 
achieves  I's  goal  (each  with  expected  cost  of  |). 
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Table  1:  Possible  Coalitions  in  the  Tileworld  Example 

The  Tileworld  example  from  Figure  1  can  be  described  as  a  coalition  game  (jV,  u)  such 
that:  u({l})  =  v{{2})  =  v{{3})  =  i;({2,3})  =  0,  i;({l,2})  =  t;({l,3})  =  12,  and  ^({1,2,3})  = 
17. 

Note  that  the  value  derived  by  a  coalition  is  independent  of  the  coalition  structure.  A 
given  coalition  is  guaranteed  to  get  a  certain  utility,  regardless  of  what  coalitions  are  formed 
by  the  other  agents.  In  the  Tileworld  domain  this  assumption  is  not  necessarily  true — 
though  it  is  true  in  the  example  we  gave  above.  We  will  see  below  that  in  Task  Oriented 
Domains  [21]  this  definition  of  the  coalition  value  is  directly  applicable. 


3      Task  Oriented  Domains 

Definition  2  A  Task  Oriented  Domain  (TOD)  is  a  tuple  <T,A,c>  where:  T  is  the  set 
of  all  possible  tasks;  A  =  {Ai, .  . .  An}  is  an  ordered  list  of  agents;  c  is  a  monotonic  function 
c:  [2^]  ->  1R+.  [2^]  stands  for  all  the  finite  subsets  of  T .  For  each  finite  set  of  tasks  X  CT, 
c{X)  is  the  cost  of  executing  all  the  tasks  in  X  by  a  single  agent,  c  is  monotonic,  i.e.,  for 
any  two  finite  subsets  X  CY  CT,  c{X)  <  €{¥);  c(0)  =  0. 

An  encounter  within  a  TOD  <T,A,c>  is  an  ordered  list  (Tj, . . .  ,r„)  such  that  for  all 
k  e  {l...n],Tk  is  a  finite  set  of  tasks  from  T  that  Ak  needs  to  achieve.  Tk  will  also  be  called 
Ak  's  goal. 

The  Postmen  Domain  [19]  is  one  classic  example  of  a  TOD.  In  this  domain,  each  agent  is 
given  a  set  of  letters  to  deliver  to  various  nodes  on  a  graph;  starting  and  ending  at  the  Post 
Office,  the  agents  are  to  traverse  the  graph  and  make  their  deliveries.  Agents  can  reach 
agreements  to  carry  one  another's  letters,  and  save  on  their  travel. 

In  multi-agent  Task  Oriented  Domains,  agents  can  reach  agreements  about  the  re-distribution 
of  tasks  among  themselves.  When  there  are  more  than  two  agents,  the  agents  can  also  form 
coalitions  such  that  tasks  are  re-distributed  only  among  the  members  of  the  same  coalition. 
When  mixed  deals  are  being  used  by  agents  (those  are  agreements  where  agents  settle  on 
a  probabilistic  distribution  of  tasks),  it  can  be  useful  to  conceive  of  the  interaction  as  a 
coalition  game  with  transferable  utility.  The  use  of  probability  smooths  the  discontinuous 
distribution  of  tasks,  and  therefore  of  utility.     However,  utility  is  still  not  money  in  a  classic 


TOD;  utility  is  the  difFerencp  between  ike  cmX  j?f  -achieving  your  goal  alone,  and  the  cost 
of  your  part  of  the  deal.  Therefore,  tl^ere  "is  an  upper  bound  on  the  amount  of  utility  that 
each  agent  can  get — no  agent  can  get  more  utility  than  his  stand-alone  cost.  As  we  shall  see 
below,  however,  our  model  never  attempts  to  violate  this  upper  bound  on  utility. 

3.1  Subadditive  Task  Oriented  Domains 

In  some  domains,  by  combining  sets  of  tasks  we  may  reduce  (and  can  never  increase)  the 
total  cost,  as  compared  with  the  sum  of  the  costs  of  achieving  the  sets  separately.  The 
Postmen  Domain,  for  example,  has  this  property,  which  is  called  subadditivity.  If  X  and 
Y  are  two  sets  of  addresses,  and  we  need  to  visit  all  of  them  {X  U  Y),  then  in  the  worst 
case  we  will  be  able  to  do  the  minimal  cycle  visiting  the  X  addresses,  then  do  the  minimal 
cycle  visiting  the  Y  addresses.  This  might  be  our  best  plan  if  the  addresses  are  disjoint  and 
decoupled  (the  topology  of  the  graph  is  against  us).  In  that  case,  the  cost  of  visiting  all  the 
addresses  is  equal  to  visiting  one  set  plus  the  cost  of  visiting  the  other  set.  However,  in  some 
cases  we  may  be  able  to  do  better,  and  visit  some  addresses  on  the  way  to  others. 

Definition  3  TOD  <  T,A,c  >  will  be  called  subadditive  if  for  all  finite  sets  of  tasks 
X,  Y  C  T,  we  have  c{X  U  K)  <  c{X)  +  c{Y). 

3.2  Coalitions  in  Subadditive  Task  Oriented  Domains 

In  a  TOD,  a  group  of  agents  (a  coalition)  can  coordinate  by  redistributing  their  tasks  among 
themselves.  In  a  subadditive  TOD,  the  way  to  minimize  total  cost  is  to  aggregate  as  many 
tasks  as  possible  into  one  execution  batch  (since  the  cost  of  the  union  of  tasks  is  always  less 
than  the  sum  of  the  costs).  Therefore,  the  maximum  utility  that  a  group  can  derive  in  a 
subadditive  TOD  is  the  difference  between  the  sum  of  stand-alone  costs  and  the  cost  of  the 
overall  union  of  tasks.  This  difference  will  be  defined  to  be  the  value  of  the  coalition. 

Definition  4  Given  an  encounter  {Ti,...,Tn)  in  a  subadditive  TOD  <  T,A,c  >,  we  will 
define  the  coalition  game  induced  by  this  encounter  to  be  (N,v),  such  that  N  =  {1,2, ..  .  ,n}, 
and  V5  C  N,  v{S)  =  E.6S  c(r.)  -  c(U.^s  T,). 

3.3  Superadditive  Coalition  Games 

It  seems  intuitively  reasonable  that  agents  in  a  coalition  game  should  not  suffer  by  coordinat- 
ing their  actions  with  a  larger  group.  In  other  words,  if  you  take  two  disjoint  coalitions,  the 
utility  they  can  derive  together  should  not  be  less  than  the  sum  of  their  separate  utilities  (at 
the  worst,  they  could  "coordinate"  by  ignoring  each  other).  This  property  (which,  however, 
is  not  always  present)  is  called  superadditivity. 

Definition  5  A  coalition  game  with  transferable  xdility  in  normal  characteristic  form  {N,v) 
is  superadditive  if  for  any  disjoint  coalitions  S,V  C  N,  S  D  V  =  ib,  then  v{S)  +  v{V)  < 
v{SUV). 


Theorem  1   Any  encounter  (Tj, . . . ,  r„)  in  a  subadditive  TOD  induces  a  superadditive  coali- 
tion game  {N,v). 

Proof.  Proofs  can  be  found  in  [22].         o 


4      Mechanisms  for  Subadditive  TODs 

We  would  like  to  set  up  rules  of  interaction  such  that  communities  of  self-interested  agents 
will  form  beneficial  coalitions.  There  are  several  attributes  of  the  rules  of  interaction  that 
might  be  important  to  the  designers  of  these  self-interested  agents  (as  discussed  further 
in  [12]): 

1.  Efficiency:  The  agents  should  not  squander  resources  when  they  come  to  an  agreement; 
there  should  not  be  wasted  utility  when  an  agreement  is  reached.  Since  the  coalition  game 
is  superadditive  it  means  that  the  sum  of  utilities  of  the  agents  should  be  equal  to  v{N). 

2.  Stability:  Since  the  coalition  game  is  superadditive,  the  full  coalition  can  always  satisfy 
the  efficiency  condition,  and  therefore  we  will  assume  that  the  full  coalition  will  be  formed. 
The  stability  condition  then  relates  to  the  payoff  vector  (ui,...,u„)  that  assigns  to  each 
agent  i  a  utility  of  u,.  There  are  three  levels  of  stability  (rationality)  conditions:  individual, 
group,  and  coalition  rationality.  Individual  Rationality  means  that  that  no  individual  agent 
would  like  to  opt  out  of  the  full  coalition;  i.e.,  m  >  v{{i])  =  0.  Group  Rationality  (Pareto 
Optimality)  means  that  the  group  as  a  whole  would  not  prefer  any  other  payoff  vector  over 
this  vector;  i.e.,  Er=i  ".  -  ^(")-  This  condition  is  equivalent  to  the  efficiency  condition 
above.  Coalition  Rationality  means  that  no  group  of  agents  should  have  an  incentive  to 
deviate  from  the  full  coalition  and  create  a  sub-coalition;  i.e.,  for  each  subset  of  agents 
SCN,Z,esUi>v{S). 

3.  Simplicity:  It  will  be  desirable  for  the  overall  interaction  environment  to  make  low 
computational  demands  on  the  agents,  and  to  require  little  communication  overhead. 

4.  Distribution:  Preferably,  the  interaction  rules  will  not  require  a  central  decision  maker, 
for  all  the  obvious  reasons.  We  do  not  want  our  distributed  system  to  have  a  performance 
bottleneck,  nor  collapse  due  to  the  single  failure  of  a  special  node. 

5.  Symmetry:  Two  symmetric  agents  should  be  assigned  the  same  utility  by  the  mechanism 
(two  agents  are  symmetric  when  they  contribute  exactly  the  same  value  to  all  possible 
coalitions). 

We  will  develop  a  mechanism  for  subadditive  TODs  such  that  agents  agree  on  the  all- 
or-nothing  deal,  in  which  each  agent  has  some  probability  of  executing  all  the  tasks.  The 
question  that  we  will  try  to  answer  now  is  "What  should  be  the  division  of  utilities  among 
all  agents  in  the  full  coalition?" 

Coalition  rationality  is  the  strongest  stability  condition,  and  implies  individual  rationality 
and  group  rationality.^  However,  this  condition  is  very  strong,  and  cannot  always  be  satisfied. 
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Figure  2:  Example  of  an  Unstable  Encounter 

Consider  the  encounter  from  a  three-agent  Postmen  Domain  that  can  be  seen  in  Figure  2. 

The  Post  Office  is  in  the  center.  The  length  of  each  arch  is  1.  The  encounter  is  (Ti  = 
{a,d},T2  =  {6,  e},  T3  =  {c,  f}).^  Each  agent  can  deliver  his  letters  with  a  cost  of  4.  The  cost 
of  delivering  the  union  of  the  letters  of  any  two  agents  is  5.  Therefore,  the  value  of  any  two 
agents'  coalition  is  (2  *  4)  -  5  =  3.  The  cost  of  delivering  all  the  letters  is  8.  Therefore,  the 
value  of  the  full  coalition  is  (3  *  4)  -  8  =  4.  We  would  like  to  find  a  payoff  vector  {111,112-,  U3) 
that  satisfies  the  following  conditions: 

(1)  V^G  {1,2,3}  u,>v{{i})  =  0; 

(2)  Vi  ^  J  6  {1,2,  3}  u.  +  u,  >  v{{z,j})  =  3; 

(3)  U1  +  U2  +  U3  >  i;({l,2,3})  =  4. 

Since  the  full  coalition  is  also  the  maximal  valued  configuration,  condition  (3)  is  satisfied 
by  equality  (i.e.,  Ui+U2+"3  =  4).  If  we  add  up  all  the  inequalities,  we  will  have  uj+ua+us  >  = 
4:^,  which  cannot  be  satisfied.  This  means  that  in  any  division  of  the  value  of  the  full  coalition 
among  the  agents  there  will  be  at  least  two  agents  that  will  prefer  to  opt  out  of  the  coalition 
and  form  a  sub-coalition!  For  example,  assume  that  the  full  coalition  is  formed  with  payoff 
vector  (1,1,2).  Agents  1  and  2  can  get  more  by  forming  a  coalition  (i.e.,  by  excluding  agent 
3  from  the  coalition).  The  new  payoff  vector  can  then  be  (l|,li,0).  This  coalition  and 
payoff  vector  is  also  not  stable,  since  now  agent  3  can  tempt  agent  2  (for  example)  to  form 
a  coalition  with  3  by  promising  2  more  utility.  The  new  payoff  vector  can  then  be  (0,2,1). 
However,  now  agent  1  can  convince  the  two  agents  that  they  all  can  do  better  by  forming 
the  full  coalition  again.  The  new  payoff  vector  can  then  be  (|,2|,  l|).  This  coalition  is  also 
not  stable. . . 


-AH  payoffs  that  satisfy  the  coalition  rationality  conditions  are  called  the  core  of  the  game  in  the  game 
theory  literature.  See,  for  example,  [4]. 

Agent  1  has  to  deliver  letters  to  addresses  a  and  d,  agent  2  has  to  deliver  letters  to  addresses  6  and  e, 
and  agent  3  has  to  deliver  letters  to  addresses  c  and  /. 


4.1  Shapley  Value 

The  Shapley  Value  [14,  18]  for  agent  i  is  a  weighted  average  of  all  the  utilities  that  i  con- 
tributes to  all  possible  coalitions.  The  weight  of  each  coalition  is  the  probability  that  this 
coalition  will  be  formed  in  a  random  process  that  starts  with  the  one-agent  coalition,  and 
in  which  this  coalition  grows  by  one  agent  at  a  time  such  that  each  agent  that  joins  the 
coalition  is  credited  with  his  contribution  to  the  coalition.  The  Shapley  Value  is  actually 
the  expected  utility  that  each  agent  will  have  from  such  a  random  process  (assuming  any 
coalition  and  permutation  is  equally  likely). 

Definition  6  Given  a  superadditive  coalition  game  with  transferable  utility  in  normal  char- 
acteristic  form  {N,v),  the  Shapley  Value  is  defined  to  be:  u,  =  YlscNiiS  n>        ''v{S  U 

{z})-v{S). 

The  Shapley  Value  satisfies  the  efficiency,  symmetry,  and  individual  rationality  condi- 
tions [14,  4].  However,  it  does  not  necessarily  satisfy  the  coalition  rationality  condition. 

Theorem  2  The  Shapley  Value  is  also:  u,  =  c{T,)  -  E5cAf,;g5  ^""'^L?'''^''^'c(-^)-  ^cl"?)  = 
c{S  U  {i})  —  c{S),  i.e.,  the  additional  cost  that  agent  i  adds  to  a  coalition  S. 

Agent  I's  Shapley  Value  is  the  difference  between  the  cost  of  its  goal  and  its  weighted 
average  cost  contribution  to  all  possible  coalitions.  The  cost  that  agent  i  can  contribute  to  a 
coalition  is  bounded  by  c{T,).  Therefore,  the  average  contribution  is  also  bounded  by  c{T,), 
which  also  means  that  the  Shapley  Value  is  positive  (i.e.,  satisfies  the  individual  rationality 
contribution)  and  bounded  by  c{T,)  (which  is  also  the  maximal  utility  that  an  agent  can 
get  according  to  our  model).  Thus  (as  we  promised  above  in  Section  3),  our  model  never 
attempts  to  transfer  to  an  agent  more  utility  than  he  can  get  by  simply  having  his  tasks 
performed  by  others. 

4.2  Mechanisms  for  Subadditive  TODs 

We  can  define  a  Shapley  Value-based  mechanism  for  subadditive  TODs  that  forms  the 
full  coalition  and  divides  the  value  of  the  full  coalition  using  the  Shapley  Value.  The 
mechanism  simply  chooses  the  following  (all-or-nothing)  mixed  deal,  (pi, . . .  ,p„),  such  that 

Theorem  3    The  above  all-or-nothing  deal  is  well-defined,  (i.e.,  ^i  ^  N:Q  <  Pi  <\\  E"=i  Pi  = 
\)  and  gives  each  agent  i  an  expected  utility  that  is  exactly  the  Shapley  Value  u,. 


4.3      Evaluation  of  the  Mechanism 

The  above  mechanism  gives  each  agent  its  Shapley  Value.  The  mechanism  is  thus  symmetric 
and  efficient  (i.e.,  satisfying  group  rationality),  and  also  satisfies  the  criterion  of  individual 
rationality.  However,  as  was  seen  in  Example  2,  no  mechanism  can  guarantee  coalition  ratio- 
nality. Besides  failing  to  guarantee  coalition  rationality,  the  mechanism  also  does  not  satisfy 
the  simplicity  condition.  It  requires  agents  to  calculate  the  Shapley  Value,  a  computation 
that  has  exponential  computational  complexity. 

The  computational  complexity  of  a  mechanism  should  be  measured  relative  to  the  com- 
plexity of  the  agent's  standalone  planning  problem.  This  relative  measurement  would  then 
signify  the  computational  overhead  of  the  mechanism.  Each  agent  in  a  Task  Oriented  Do- 
main needs  to  calculate  the  cost  of  his  set  of  tasks,  i.e.,  to  find  the  best  plan  to  achieve  them. 
Calculation  of  the  value  of  a  coalition  is  linear  in  the  number  of  agents  in  the  coalition.'*  The 
calculation  of  the  Shapley  Value  requires  an  evaluation  of  the  value  of  all  (2")  possible  coali- 
tions. In  Section  6  below  we  will  show  that  there  exists  another  Shapley-based  mechanism 
that  has  linear  computational  complexity. 

5      Concave  TODs 

Definition  7  [Concavity]:^    TOD  <  T,A,c>  will  be  called  concave  if  for  all  finite  sets 
of  tasks  X  C  r,  Z  CT,  we  have  c{Y  U  Z)  -  c{Y)  <  c{X  U  Z)  -  c{X). 

All  concave  TODs  are  also  subadditive.  It  turns  out  that  general  subadditive  Task  Ori- 
ented Domains  can  be  restricted,  becoming  concave  Task  Oriented  Domains.  For  example, 
the  Postmen  Domain  is  subadditive,  when  the  graphs  over  which  agents  travel  can  assume 
any  topology.  By  restricting  legal  topologies  to  trees,  the  Postmen  Domain  becomes  concave. 

Definition  8  A  coalition  game  with  transferable  utility  in  normal  characteristic  form  {N,v) 
is  convex  if  for  any  coalitions  5,  V,  v(S)  +  viV)  <  v(S  U  V)  4-  v{S  D  V). 

In  convex  coalition  games,  the  incentive  for  an  agent  to  join  a  coalition  grows  as  the 
coalition  grows. 

Theorem  4  Any  encounter  (Ti, . . . ,  T^)  in  a  concave  TOD  induces  a  convex  coalition  game 

{N,v). 

Theorem  5   [Shapley  (1971)]  [13]:  In  convex  coalition  games,  the  Shapley  Value  always 
satisfies  the  criterion  of  coalition  rationality. 

In  concave  TODs,  the  Shapley-based  mechanism  introduced  above  is  fully  stable,  i.e., 
satisfies  individual,  group,  and  coalition  rationality. 

The  cost  of  a  set  of  tasks  needs  to  be  calculated  only  a  linear  number  of  times. 
^The  definition  is  from    [21]. 
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6      The  Random  Permutation  Mechanism 

The  Shapley  Value  is  equal  to  the  expected  contribution  of  an  agent  to  the  full  coalition, 
assuming  that  all  possible  orders  of  agents  joining  and  forming  the  full  coalition  are  equally 
likely.  This  leads  us  to  a  much  simpler  mechanism  called  the  Random  Permutation  Mech- 
anism: agents  choose  a  random  permutation  and  form  the  full  coalition,  one  agent  after 
another,  according  to  the  chosen  permutation.  Each  agent  (i)  gets  utility  {wi)  that  is  equal 
to  its  contribution  to  the  coalition,  at  the  time  he  joined  it.  This  is  done  by  agreeing  on  the 
all-or-nothing  deal,  (pi, . . .  ,Pn),  such  that  p,  =  ''' JC*"' . 

Theorem  6  //  each  permutation  has  an  equal  chance  of  being  chosen,  then  the  Random 
Permutation  Mechanism  gives  each  agent  an  expected  utility  that  is  equal  to  its  Shapley 
Value. 

The  Shapley-based  Random  Permutation  Mechanism  does  not  explicitly  calculate  the 
Shapley  Value,  but  instead  calculates  the  cost  of  only  n  sets  of  tasks.  Therefore,  it  has  linear 
computational  complexity.  The  problem  of  coalition  formation  is  reduced  to  the  problem  of 
reaching  consensus  on  a  random  permutation. 

6.1      Consensus  on  Permutation 

No  agent  would  like  to  be  the  first  one  that  starts  the  formation  of  the  full  coalition  (since 
this  agent  by  definition  gets  zero  utility).  If  the  domain  is  concave  (and  therefore  the  coali- 
tion game  is  convex),  each  agent  has  an  incentive  to  join  the  coalition  as  late  as  possible. 
To  ensure  stability,  we  need  to  find  a  consensus  mechanism  that  is  resistant  to  any  coali- 
tion manipulation.  No  coalition  should  be  able,  by  coordination,  to  influence  the  resulting 
permutation  such  that  the  members  of  the  coalition  will  be  the  last  ones  to  join  the  full 
coalition.  For  example,  this  means  that  no  coalition  of  n  -  1  agents  could  force  the  single 
agent  that  is  out  of  the  coalition  to  go  first. 

We  will  use  the  simple  cryptographic  mechanism  that  allows  an  agent  to  encrypt  a  mes- 
sage using  a  private  key,  to  send  the  encrypted  message,  and  then  to  send  the  key  such 
that  the  message  can  be  unencrypted.  Using  these  tools,  each  agent  chooses  a  random  per- 
mutation and  a  key,  encrypts  the  permutation  using  the  key,  and  broadcasts  the  encrypted 
message  to  all  other  agents.  After  he  has  received  all  encrypted  messages,  the  agent  broad- 
casts the  key.  Each  agent  unencrypts  all  messages  using  the  associated  keys.  The  consensus 
permutation  is  the  combination  of  all  permutations. 

Each  agent  can  make  sure  that  each  permutation  has  an  equal  chance  of  being  chosen  even 
if  he  assumes  that  the  rest  of  the  agents  are  all  coordinating  their  permutations  against  him 
(i.e.,  trying  to  make  him  be  the  first).  All  he  needs  to  do  is  to  choose  a  random  permutation. 
Since  his  permutation  will  also  be  combined  into  the  final  permutation,  everything  will  be 
shuffled  in  a  way  that  no  one  can  predict. 


11 


7      Conclusions 

We  have  considered  the  kinds  of  n-agent  coordination  mechanisms  that  can  be  used  in  Task 
Oriented  Domains  (TODs),  when  any  sub-group  of  agents  may  engage  in  task  exchange  to 
the  exclusion  of  others. 

We  presented  a  simple,  efficient,  symmetric,  and  individual  rational  Shapley  Value-based 
coalition  formation  mechanism  that  uses  cryptographic  techniques  for  subadditive  TODs. 
When  the  domain  is  also  concave,  the  mechanism  also  satisfies  coalition  rationality. 

Future  research  will  consider  non-subadditive  TODs.  It  will  also  consider  issues  of  in- 
centive compatibility  in  multi-agent  coalition  formation,  investigating  mechanisms  that  can 
be  employed  when  agents  have  partial  information  about  the  goals  of  other  group  members 
and  can  deceive  one  another  about  this  private  information. 
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